Fix panic when pending pipelinerun is failed #4298

ghost · 2021-10-11T17:34:07Z

Changes

PipelineRuns that are created with status PipelineRunPending
can be placed into a failed state before their execution begins.
For example: a third-party controller may be watching for pending
PipelineRuns to perform some checks on them prior to execution
beginning. If those checks fail the controller might choose to
set the PipelineRun status to failed with a relevant error message
indicating which check failed and why.

Prior to this commit when a pending PR failed our metrics code
could panic because the PR's StartTime is nil.

This commit adds a guard to the metrics code to ensure that StartTime
is not nil before computing the PR's duration. If it is nil then
we assume the duration is 0. A unit test confirming this behaviour
has been added as well.

/kind bug

Submitter Checklist

As the author of this PR, please check off the items in this checklist:

Tests included if any functionality added or changed
Follows the commit message standard
Meets the Tekton contributor standards (including
functionality, content, code)
Release notes block below has been filled in or deleted (only if no user facing changes)

Release Notes

Fixed an issue where the PipelineRun reconciler could panic if a PipelineRun with spec.status set to PipelineRunPending was placed into a failed state before execution was able to begin.

PipelineRuns that are created with status PipelineRunPending can be placed into a failed state before their execution begins. For example: a third-party controller may be watching for pending PipelineRuns to perform some checks on them prior to execution beginning. If those checks fail the controller might choose to set the PipelineRun status to failed with a relevant error message indicating which check failed and why. Prior to this commit when a pending PR failed our metrics code could panic because the PR's StartTime is nil. This commit adds a guard to the metrics code to ensure that StartTime is not nil before computing the PR's duration. If it is nil then we assume the duration is 0. A unit test confirming this behaviour has been added as well.

tekton-robot · 2021-10-11T17:37:23Z

The following is the coverage report on the affected files.
Say /test pull-tekton-pipeline-go-coverage to re-run this coverage report

File	Old Coverage	New Coverage	Delta
pkg/pipelinerunmetrics/metrics.go	81.6%	82.0%	0.4

ghost · 2021-10-11T17:59:54Z

/test pull-tekton-pipeline-alpha-integration-tests

ghost · 2021-10-11T18:55:09Z

/test pull-tekton-pipeline-alpha-integration-tests

pritidesai

thanks @sbwsg for this fix 👍

tekton-robot · 2021-10-11T21:51:26Z

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: pritidesai

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Needs approval from an approver in each of these files:

~~OWNERS~~ [pritidesai]

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

pritidesai · 2021-10-11T21:52:23Z

read tcp 10.28.0.18:39812->104.18.122.25:443: read: connection reset by peer

/test pull-tekton-pipeline-alpha-integration-tests

ghost · 2021-10-12T13:00:24Z

/test pull-tekton-pipeline-alpha-integration-tests

bobcatfish · 2021-10-12T21:26:43Z

ooo yikes nice fix!

/lgtm

(side note, apparently we only have 2 test flake issues - im wondering if this PR is a fluke or if we're not recording the flakes that are occurring 🤔 )

pritidesai · 2021-10-12T21:37:50Z

ooo yikes nice fix!

/lgtm

(side note, apparently we only have 2 test flake issues - im wondering if this PR is a fluke or if we're not recording the flakes that are occurring 🤔 )

I have seen multiple flakes in the past few days, but haven't recorded any 😞 including, #4281 (comment), #4286 (comment), and many more

bobcatfish · 2021-10-13T21:04:21Z

@pritidesai i wonder if it would be helpful if we made a template for creating flake bugs specifically to make it easier to record them 🤔

pritidesai · 2021-11-16T18:07:22Z

@pritidesai i wonder if it would be helpful if we made a template for creating flake bugs specifically to make it easier to record them 🤔

sorry @bobcatfish for delayed response, just saw this.

Definitely, I think it will be useful to create such template. I am not sure if its possible but when prow reports integration test failure, if we can attach such template so that the PR author can create a new issue.

tekton-robot added release-note Denotes a PR that will be considered when it comes time to generate release notes. kind/bug Categorizes issue or PR as related to a bug. labels Oct 11, 2021

tekton-robot requested review from dibyom and vdemeester October 11, 2021 17:34

tekton-robot added the size/M Denotes a PR that changes 30-99 lines, ignoring generated files. label Oct 11, 2021

pritidesai approved these changes Oct 11, 2021

View reviewed changes

tekton-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Oct 11, 2021

tekton-robot assigned bobcatfish Oct 12, 2021

tekton-robot added the lgtm Indicates that a PR is ready to be merged. label Oct 12, 2021

tekton-robot merged commit 0299c6c into tektoncd:main Oct 12, 2021

pritidesai added the needs-cherry-pick Indicates a PR needs to be cherry-pick to a release branch label Oct 13, 2021

pritidesai mentioned this pull request Oct 13, 2021

Fix panic when pending pipelinerun is failed #4306

Merged

5 tasks

ghost removed the needs-cherry-pick Indicates a PR needs to be cherry-pick to a release branch label Jan 26, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix panic when pending pipelinerun is failed #4298

Fix panic when pending pipelinerun is failed #4298

ghost commented Oct 11, 2021

tekton-robot commented Oct 11, 2021

ghost commented Oct 11, 2021

ghost commented Oct 11, 2021

pritidesai left a comment

tekton-robot commented Oct 11, 2021

pritidesai commented Oct 11, 2021

ghost commented Oct 12, 2021

bobcatfish commented Oct 12, 2021

pritidesai commented Oct 12, 2021

bobcatfish commented Oct 13, 2021

pritidesai commented Nov 16, 2021

Fix panic when pending pipelinerun is failed #4298

Fix panic when pending pipelinerun is failed #4298

Conversation

ghost commented Oct 11, 2021

Changes

Submitter Checklist

Release Notes

tekton-robot commented Oct 11, 2021

ghost commented Oct 11, 2021

ghost commented Oct 11, 2021

pritidesai left a comment

Choose a reason for hiding this comment

tekton-robot commented Oct 11, 2021

pritidesai commented Oct 11, 2021

ghost commented Oct 12, 2021

bobcatfish commented Oct 12, 2021

pritidesai commented Oct 12, 2021

bobcatfish commented Oct 13, 2021

pritidesai commented Nov 16, 2021